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Appl. No. 09/944, 1 63 PATENT 
Amdt. dated March 15, 2004 

Amendment under 37 CFR 1.116 Expedited Procedure 
Examining Group 

Claim 37(previously presented): A method in accordance with claim 29, wherein 
said method the progression of viral dissemination via a CMV-infected leukocyte is slowed. 

Claim 38 (previously presented): A method in accordance with claim 5, wherein said 
method the progression of viral dissemination via a CMV-infected leukocyte is slowed. 

Claim 39 (previously presented): A method of claim 29, wherein the chemokine is 
fractalkine. 

Claim 40 (previously presented): A method of claim 5, wherein the chemokine is 
fractalkine. 

REMARKS/ARGUMENTS 

Claims 5, 7-13, and 29-40 were previously pending and presented for 
examination. Claims 5, 8, 29, and 32 are amended herein. Claims 7 and 30 are canceled without 
prejudice. After entry of the amendments, claims 5, 8-13, 29, and 31-40 will be pending. 

Claims 5 and 29 stand rejected for an alleged want of enablement. 

Claims 5, 7-13, and 29-40 stand rejected for an alleged lack of nonobviousness. 

Applicants respectfully respond to the above rejections below. 

Amendments to the Claims 

Base claims 5 and 29 have been amended to recite the compound subject matter 
of their canceled dependent claims 7 and 30, respectively. In addition, the recital of a "small 
organic compound having a molecular weight of less than 800 daltons" has been deleted. 
Support for the above amendments is found inter alia in the previous versions of claims 5, 7, 29, 
and 30. 

Claims 8, 13 and 32 were amended to correct their dependencies in view of the 
cancellation of the intervening claims. 
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In view of the above, Applicants submit that the Amendments to the claims add 
no new matter and respectfully request their entry. 

Response to the Rejection of Claims 5 and 29 under 35 U.S.C. §112, first paragraph. 

The Examiner essentially found the scope of the compound subject matter of base 
claims 5 and 29 to be primarily described in terms of function and not structure and thus too 
broad to be enabled. Without acquiescing to the position of the Examiner and in order to 
expedite prosecution of the present application, the base claims 5 and 29 have therefore been 
amended to recite the compound subject matter of their respective dependent claims. 

Response to the Rejection of Claims 5, 7-13 and 29-40 as allegedly unpatentable under 35 
U.S.C. § 103(a) over Protiva et al. (U.S. Patent No. 4,243,805) in view of Merck Manual of 
Diagnosis and Therapy (17th Edition). 

Applicants further request the Examiner reconsider the propriety of the above 
rejection on the grounds that the inherency argument was improper. 

The Examiner alleged that it would be obvious to treat a CMV infection with a 
neuroleptic agent in so far as: 

1) Protiva shows that compounds for use according to the invention encompass 
useful neuroleptic and psychotropic agents. 

2) The Merck Manual indicates that CMV infection can cause CNS damage and 
injury, typically during neonatal exposure; and 

3) Michealson discloses that CMV infection can cause mental retardation and that 
CMV can replicate in a variety of cell types. 

The conclusion of the Examiner appears to require the following scenario: 

1) A person is posited who has a type of CNS disorder or CNS damage that 
is suited, rather than unsuited, for therapy with a neuroleptic agent. 

2) The CNS disorder or damage happened to be caused by a CMV infection; 
in this regard Applicants acknowledge that CMV infection can cause a large variety of 



Page 7 of 10 



Appl. No. 09/944, 1 63 PATENT 
Amdt. dated March 15, 2004 

Amendment under 37 CFR 1.116 Expedited Procedure 
Examining Group 

neurological diseases (see enclosed article by van dan Pol et al., J. of Neuroscience 19 
(24):10948-10965 (1999)). 

3) A clinician happened to select a neuroleptic as set forth in the claims 
rather than another neuroleptic agent or an agent of another class altogether to treat the subject 
condition; and 

4) The CMV infection is still present and active so that viral dissemination is 
that viral dissemination can be inhibited and the method of the invention thereby unwittingly 
accomplished. 

The above rejection is based upon a theory of inherency. However, Federal 
Circuit decisions have repeatedly held that an inherent feature or result must be consistent, 
necessary, and inevitable, and not the mere possibility as set forth above 1 . As the rejection is 
improper, Applicants respectfully request that it be reconsidered and withdrawn. 



Response to the Rejection of Claims 5, 7-13 and 29-40 as allegedly unpatentable under 35 
U.S.C. §103(a) over Sindehar et al. in view of Merck Manual of Diagnosis and Therapy (17th 
Edition) and Michelson Eur. (Cytokine Netw 10(2^:286-7 (1999). al. 

With respect to the neuroleptic activity of octoclothepin and methiothepin, the 

above discussion of the disclosures of the Merck Manual and Michelson reference and a proper 

theory of inherency apply with equal force. As noted above, an inherent feature or result must be 

consistent, necessary, and inevitable, and not a mere possibility. 



1 See, e.g., Mehl/Biophile International Corp. v. Milgraum. 192 F.3d 1362. 52 USPQ2d 1303 (Fed. Cir. 1999): Atlas 
Powder Co. v. Ireco Inc.. 190F.3d 1342. 1347, 51 USPQ2d 1943. 1946 (Fed. Cir. 1999) ("To invalidate a patent by 
anticipation, a prior art reference normally needs to disclose each and every limitation of the claim, ... However, a 
prior art reference may anticipate when the claim limitation or limitations not expressly found in that reference are 
nonetheless inherent in it. ... Under the principles of inherency, if the prior art necessarily functions in accordance 
with, or includes, the claimed limitations, it anticipates."); Abbott Laboratories v. Geneva Pharmaceuticals, Inc., 182 
F.3d 1315, 51 USPQ2d 1307 (Fed. Cir. 1999), cert denied, 528 U.S. 1078(2000), In re Robertson, 169 F.3d 743, 
745, 49 USPQ2d 1949, 1950-51 (Fed. Cir. 1999) ("If the prior art reference does not expressly set forth a particular 
element of the claim, that reference still may anticipate if that element is 'inherent 1 in its disclosure. To establish 
inherency, the extrinsic evidence 'must make clear that the missing descriptive matter is necessarily present in the 
thing described in the reference, and that it would be so recognized by persons of ordinary skill. 1 ... 'Inherency, 
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A Showing of Surprising or Unexpected Results Rebuts the Prima Facie Case 

The Action rightly pointed out that a surprising result can negate a prima facie 
case of obviousness. The Applicants point out that the specification teaches that administration 
of the subject compounds can inhibit CMV dissemination and are useful in treating CMV 
infection. With respect to the instant nonobviousness determination, Applicants submit that one 
of ordinary skill in the art would simply not have expected neuroleptic agents to also have 
antiviral, anti-CMV activity. 

It is not mere attorney argument to point out the importance of the amino acid 
primary sequence to protein binding 2 , and that therefore it is entirely surprising to find an agent 
that binds to a mammalian protein neurotransmitter receptor also binds to a wholly different viral 
protein receptor. Neuroleptic agents typically bind receptors for neurotransmitters (e.g., the 
dopamine receptor for octoclothepin, the 5-HT receptor for methiothepin; see enclosed Sigma 
Catalog and Merck edition summaries). The fact that such compounds as recited in the claims 
also bind to the CMV US 28 receptor is certainly surprising. No a priori rationale has been set 
forth upon which such a surprising result could be deemed expected for these compounds. And 
any generally applicable rationale would be inconsistent with the Action's stated grounds for 
rejecting the enablement of the base claims for lack of structural definition. Nor, is there any 
requirement that a specification set forth a side-by-side comparison of the inventive method with 
the prior art method to support a surprising result. As to the alleged absence of any evidence of 
the surprising result, Applicants point out that specification teaches and exemplifies in Examples 
2 and 3 compounds with such US 28 receptor binding properties. Applicants point out that the 



however, may not be established by probabilities or possibilities. The mere fact that a certain thing may result from a 
given set of circumstances is not sufficient.' "); 

While it is known that many amino acid substitutions are possible in any given protein, the position within the 
protein's sequence where such amino acid substitutions can be made with reasonable expectation of success are 
limited. Certain positions in the sequence are critical to the three-dimensional structure/function relationship, and 
these regions can tolerate only conservative substitutions or no substitutions. Residues that are directly involved in 
protein functions such as binding will certainly be among the most conserved (Bowie et al. Science, 247:1306-1310, 
1990, p. 1306, col. 2 (enclosed)). 
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instant Action has implicitly acknowledged the methods claims reciting the compounds of 
Formula I are enabled. 



In view of the above, Applicants also request that both the above grounds for 



In view of the foregoing, Applicants believe all claims now pending in this 



Application are in condition for allowance and an action to that end is respectfully requested. 



If the Examiner believes a telephone conference would expedite prosecution of 



this application, please telephone the undersigned at 925-472-5000. 



TOWNSEND and TOWNSEND and CREW LLP 

Two Embarcadero Center, Eighth Floor 

San Francisco, California 941 1 1-3834 

Tel: 925-472-5000 

Fax:415-576-0300 

Attachments 

FJM:fjm 

60165328 v1 



rejection be reconsidered and withdrawn. 



CONCLUSION 



Respectfully submitted, 



Frank J. Mycroft 
Reg. No. 46,946 
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Cytomegalovirus (CMV) infects a majority of adult humans. 
During early development and in the immunocompromised 
adult, CMV causes neurological deficits. We used recombinant 
murine cytomegalovirus (mCMV) expressing either green fluo- 
rescent protein (GFP) or j3-galactosidase under control of hu- 
man elongation factor 1 promoter or CMV immediate early-1 
promoter as reporter genes for infected brain cells. In vivo and 
in vitro studies revealed that neurons and glial cells supported 
strong reporter gene expression after CMV exposure. Brain 
cultures selectively enriched in either glia or neurons supported 
viral replication, leading to process degeneration and cell death 
within 2 d of viral exposure. In addition, endothelial cells, ta- 
nycytes, radial glia, ependymal cells, microglia, and cells from 
the meninges and choroid were infected. Although mCMV 
showed no absolute brain cell preference, relative cell prefer- 



Cytomegaloviruses (CMVs) are a widespread group of double- 
stranded DNA viruses that infect many different mammals in a 
species-specific manner. Human cytomegalovirus (hCMV) is 
commonly found in humans where virus distribution among 
adults ranges from 50 to 90% of the population (Mocarski, 1996; 
Johnson, 1998). Similarly, murine CMV (mCMV) is ubiquitous 
in wild mice. Both CMVs have large genomes consisting of —230 
kilobase pairs (kbp) exhibiting low level nucleotide sequence 
homology outside the iel region transcriptional enhancer and 
genome packaging signals but retaining colinear genome organi- 
zation. Although similar in virion morphology, effect on cells, 
pathogenesis, and biology, all CMVs exhibit striking species 
specificity (Osborn, 1982; Ho, 1991; Mocarski, 1996). 

CMV is the leading viral cause of congenital birth defects 
(White and Fenner, 1994; Alford and Britt, 1996). CMV can 
cause substantial brain damage when infection occurs during 
early human development, resulting in epilepsy, microencephaly, 
microgyria, hydrocephalus, deafness, and decreased IQ (Bray et 
al., 1981; Bale et al., 1985; Hicks et al., 1993; Perez-Jiminez et al., 
1998). CMV infections are found in —1% of human births, and 
clear neurological damage is found in 10% of those infected 
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ences were detected. Radial glia cells play an important role in 
guiding migrating neurons; these were viral targets in the de- 
veloping brain, suggesting that cortical problems including mi- 
crogyria that are a consequence of CMV may be caused by 
compromised radial glia. Although CMV is a species-specific 
virus, recombinant mCMV entered and expressed reporter 
genes in both rat and human brain cells, suggesting that mCMV 
might serve as a vector for gene transfer into brain cells of 
non-murine species. GFP expression was sufficiently strong 
that long axons, dendrites, and their associated spines were 
readily detected in both living and fixed tissue, indicating that 
mCMV reporter gene constructs may be useful for labeling 
neurons and their pathways. 

Key words: virus; neuron; GFP; development; mouse CMV; 
gene therapy; neuropathology 



(Hicks et al., 1993) with a higher percentage of more subtle 
neurological damage probable (Johnson, 1998). The range of cell 
types that are involved remains poorly defined (Ho, 1991). 

In contrast to early development,, children and adults control 
infection with little apparent effect on the nervous system, in part 
because of an effective immune response that maintains the virus 
in a lifelong latency. In immunocompromised individuals, CMV 
can contribute to neurological deficits and mental disorders (Na- 
via et al., 1986; Chimelli et al., 1992; d'Arminio Monforte et al., 
1992; Fiala et al., 1993; Arribas et al., 1995). Increased levels of 
virus in the brain are often associated with later stages of HIV 
infection (Kalayjian et al., 1993), sometimes with cellular colo- 
calization of CMV and HIV (Nelson et al., 1988). 

Microgyri found in cases of congenital disease (Diezel, 1954) 
have been suggested to be caused by CMV-mediated deficits in 
blood supply possibly related to endothelial cell infection (Tsut- 
sui, 1995) rather than a direct damage to neurons (Marques-Dias 
et al., 1984). Others have suggested that the primary cell involved 
is a glial cell, either microglia as studied in guinea pig (Booss et 
al., 1988) or possibly a monocyte-derived macrophage/microglia 
as described in humans (Pulliam, 1991). Neuron and astrocyte 
involvement has also been suggested in developing rodents (Tsut- 
sui, 1995) and HIV-infected humans (Wiley et al., 1986; Wiley 
and Nelson, 1988). In many cases, a single cell type has been 
suggested as the primary substrate for infection, with little de- 
tectable involvement of other cell types. 

The species specificity of this group of viruses (Mocarski, 1996) 
suggests that rodent CMVs, replication incompetent in human 
cells (Ho, 1991), might serve as vectors to introduce foreign genes 
into brain or brain cells. To this end we tested the hypothesis that 
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Figure 1. Map of the Hindlll K, L, and J fragments of murine CMV (K181+ strain). The murine CMV ie\/ie2/ie3 transcriptional enhancer and the 
arrangement of /el, ie2, ie3,sggl, and MCK transcripts are also shown, with splicing patterns indicated on the arrows depicting individual transcripts as 
they appear in wild-type murine CMV. The lacZ insertion mutation in RM427+, the EGFP-puro insertion in RM4503-1, and the GFP-GPT insertion 
in MC.55 are depicted below the map. Expression of the lacZ gene in RM427+ was regulated by a 199 bp human CMV ielfiel promoter (aHCM V) 
fragment (-219 to -19 relative to the transcription start site) (Manning et al., 1992). Expression of the EGFP-puro gene in RM4503 was regulated by 
a 249 bp human CM V ielliel promoter (-242 to + 7 relative to the transcription start site). These promoters display immediate early expression kinetics 
when placed adjacent to the murine CMV enhancer (Enh) in the Hpal sites of the iel gene (Manning et al., 1992; our unpublished observations). 
Expression of EGFP in MC.55 was regulated by the human elongation factor la promoter (EF-Ia) (Uetsuki et al., 1989), whereas the E. coli GPT was 
under the control of the mouse phosphoglycerate kinase (PGK) promoter. 



mCMV would serve to introduce foreign genes into mouse, rat, 
and human brain cells. Although a previous analysis had sug- 
gested that neither human embryonic brain cells nor human cell 
lines were susceptible to mCMV entry or infection (Kim and 
Carp, 1971), more recent advances in molecular biology of re- 
combinant viruses and reporter gene induction provide a more 
sensitive assay, as described here. We generated mCMVs ex- 
pressing an enhanced green fluorescent protein (GFP) or 
p-galactosidase (jS-gal) to study CMV infection and reporter gene 
expression in vitro and in vivo in live and fixed brain cells. 

MATERIALS AND METHODS 

Recombinant CMV. Recombinant viruses derived from the K181 strain of 
mCMV capable of expressing /3-gal (RM427+) or enhanced GFP 
(EGFP) (MC.55 and RM4503) were used in these studies (Fig. 1). 
RM4503, like RM427 (Manning et al., 1992), was constructed to express 
a marker gene insert under control of a chimeric promoter-enhancer 
composed of an hCMV promoter-enhancer fragment (-242 to +7 
relative to the transcription start site) inserted adjacent to the mCMV 
enhancer in the mCMV genome. This construct included an SV40- 
derived polyadenylation signal downstream of the marker gene and 
directed constitutive high-level expression of the marker throughout 
infection (Manning et al., 1992; Saederup et al., 2000). The EGFP-puro 
construct, a gift of Mark Prichard and Greg Pari (Hybridon Corporation, 
Cambridge, MA), fuses the entire EGFP protein coding sequence to the 
puromycin resistance protein coding sequence with an intervening five 
amino acid spacer. The expression cassette was inserted into the mCM V 
id gene, which has been shown to be completely dispensable for viral 
growth in cell culture as well as for growth, latency, and pathogenesis in 
BALB/c mice (Cardin et al., 1995). pON4503 was constructed by digest- 
ing pON4457, which carries a 6.6 kbp Oral /EcoRI fragment of mCMV 
DNA (nucleotide 183086-189674), with Hpal, cleaving at two closely 
spaced sites in the ie2 promoter and cloning a 1.7 kbp SnaBl/Hpal 
fragment from EGFP-puro in the same transcriptional orientation as ie2 
(Fig, 1). Recombinant virus was created by co-transfection of NIH 3T3 
cells with RM427+ DNA and Pacl/Afill digested pON4503 with Super- 
feet (Life Technologies, Gaithersburg, MD) in OptiMEM (Life Tech- 



nologies) followed by addition of standard DM EM (Life Technologies) 
supplemented with 10% NuSerum (Fisher Scientific, Pittsburgh, PA) and 
antibiotics as described (Vieira et al., 1994). Recombinant viruses were 
isolated by plaque assay from supernatants collected at 7 d after trans- 
fection. Recombinant virus was enriched by replacing standard medium 
with medium containing puromycin (5 jLtg/ml) (Sigma, St. Louis, MO) for 
the final 2 d and during one additional round of growth after which 
—80% of plaques exhibited green fluorescence. Enriched pools were 
used to inoculate BALB/c mice by the intraperitoneal route; progeny 
virus was recovered from salivary glands 14 d after inoculation and was 
followed by two rounds of limiting dilution purification at which time 
RM4503 was judged pure and authentic by the absence of any 0-gal- 
positive virus and by DNA blot hybridization of viral DNA restriction 
fragments. Green fluorescence was detectable as early as 6 hr after 
infection with plaque-pure RM4503 in NIH 3T3 cells. RM4503 grew as 
well as wild type in cell culture and, after intraperitoneal inoculation, 
reached titers similar to wild type in salivary glands. 

A second recombinant mCMV (Fig. 1, MC55) containing the GFP 
gene was constructed using pON488, which is composed of the Oral 
(position 183086) to Clal (position 188569) fragment of mouse CMV 
(Rawlinson et al., 1996) containing the immediate early-2 gene cloned 
into pUC2l, which had been cut with Pstl (made blunt with Klenow 
fragment and all four deoxynucleotides) and Clal. A cassette consisting 
of the EGFP gene (Clontech, Palo Alto CA) under the control of human 
elongation factor la promoter (Uetsuki et al., 1989), and the Escherichia 
coli guanosine-hypoxanthine phosphoribosyltransferase gene (GPT) un- 
der the control of the mouse phosphoglycerate kinase (PGK) promoter, 
was inserted between the Hpal (position 184236) and Nhel (position 
187119) sites of pON488 to create pQ55. For the transfection of NIH3T3 
cells, pQ55 was digested with Sail, extracted with phenol/chloroform 
(24:1) and chloroform, ethanol-precipitated, dried, and suspended in 
STE (5 mM NaCl, 5 niM Tris-HCI, pH 7.5, 1 mM EDTA) at a concen- 
tration of 1-2 mg/ml. For electroporation, 2 X 10* to 5 X 10* NIH 3T3 
cells plus 20 mg DNA in 0.4 ml of electroporation buffer [a 1:3 mixture 
of OptiMEM I (Life Technologies) and cytomix (van der Hoff et al., 
1992) (120 mM KC1, 0.15 mM CaC12, 10 mM K 2 HP04/KH 2 P0 4 , p H7.6, 
5 mM MgCl 2 )] were electroporated with a BTX ECM 600 instrument set 
at 275 V and 1075 mF in a 4 mm cuvette at room temperature. Cells were 
plated after electroporation and infected with mCMV at an multiplicity 
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figt/re 2 Glia infection in vitro. A, After 30 hr, most astrocytes in vitro 
were green, indicating infection with mCMV. Scale bar, 8 jim. 5, By 54 
hr, most glia were dead or dying and had a round featureless shape. Scale 
bar, 25 /xm. C-E, Olfactory ensheathing cells (olf.ex.) became green 30 hr 
after infection with CM V. Scale bar, 12 p,m. 



of infection (MOI) of 2-5 at 18 hr after electroporation. Progeny virus 
from the transfection/infection cultures were grown for two rounds under 
selection for the GPTgene as described (Vieira et al., 1994) to enrich for 
recombinant virus. Recombinant virus, identified by green fluorescent 
plaques under 488 nm illumination, were ultimately purified by twofold 
limiting dilution. In this MC.55 recombinant virus the iel gene, which is 
dispensable for growth (Cardin et al., 1995), is disrupted. Viability of 
mCMV was expressed as plaque-forming units (pfu) determined before 
the virus was frozen in stock solutions at -80° for future use. 

Viral replication. If virus replicates in the brain cultures, then as time 
progresses, the culture supernatant should show an increased concentra- 
tion of infectious virus. To test this, rat (Sprague Dawley) neurons from 
hippocampus, hypothalamus, and cortex were mixed and plated in 35- 
mm-diameter wells at a density of 10 6 cells/well. Parallel experiments 
were performed with mouse (BALB/c) neurons. Each condition was 
performed in triplicate. Brain cultures were infected with either of two 
different GFP-expressing viruses. The MC.55 virus was used at 10 ft pfu, 
and RM4503 was used at 5 X 10 s pfu. At 6 hr after inoculation, brain 
cultures were washed four times to remove the unadsorbed inoculum and 
placed in 2 ml growth medium. At 18 hr after the initial infection, all 
tissue culture medium supernatant (2 ml) was collected and replaced with 
2 ml sterile culture medium. Culture medium contained DM EM supple- 
mented with 10% fetal bovine serum (Hyclone, Logan, UT). Additional 
collections were made at successive 12 hr intervals. Tissue culture super- 
natant containing CMV was frozen within minutes after collection and 
kept at -80°C. 

To determine relative viral concentration, NIH 3T3 cells were grown 
on 12-mm-round coverslips until the cells were 60% confluent- Then a 
500 jul aliquot from each of the test conditions was added to the 3T3 cells. 
After 18 hr, all 3T3 cells were killed with 3% paraformaldehyde over- 
night After washing in phosphate buffer. 3T3 cells were examined in an 
inverted Nikon epifluorescent microscope using a 20 x Olympus objec- 
tive. Green fluorescent 3T3 cells were counted in eight visual fields on 
each 3T3-containing coverslip. 

Because CNS cultures contain a mixture of glia and neurons, in a 
second set of experiments we used more selective cultures enriched in 
glia or neurons to infect with MC.55, as described above. Glia were 
obtained from postnatal day (P) 5 mouse cortex. Glia were plated at low 
density and then allowed to replicate until they were 75% confluent in a 
35 mm culture dish. Neurons were obtained from a P6 cerebellum. These 
cultures consist primarily of cerebellar granule cells, neurons that un- 
dergo their final mitosis at approximately this stage of development, 
substantially later than most neurons. A more detailed description of 
granule cell (Liljelund et al., 1994) and astrocyte (van den Pol et al., 
1992) enriched cultures is found elsewhere. Granule cells were also 
plated at 75% confluency. Both cell types in these experiments were 
maintained with Neurobasal media (Life Technologies) for 4 d before 
viral contact. To avoid glial proliferation after viral infection, fetal 
bovine serum was not used during those 4 d or thereafter. Medium was 
harvested and replenished every 12 hr, as described above, and experi- 
ments were performed in duplicate. 

Virus in CNS. To determine whether mCM V would infect brain cells 
in vivo, 500 nl-1.5 jil (10 3 pfu/nl) of virus was injected with a Hamilton 
microsyringe directly into the brain. Mice from PI to adults were used. 
Adult rats also received intracerebral injections at the same virus con- 
centration. Injections were aimed so that the tip of the syringe needle 
would pass through the cerebral cortex and end in the striatum, hip- 
pocampus, or hypothalamus. To reduce a sudden increase in local pres- 
sure, the virus was injected slowly over 2-3 min. 

Microscopy. Photomicrographs were taken on an Olympus 1X70 in- 
verted fluorescent microscope with a Spot 2 digital camera (Diagnostic 
Instruments) interfaced with a Macintosh computer. Contrast was ad- 
justed with Adobe Photoshop, and images were printed on a Kodak 8650 
dye sublimation digital printer. 

RESULTS 

Viral infection of neurons and glia in vitro 

To examine the course of CMV infection of brain cells, develop- 
ing brains were cultured on glass coverslips and studied after 
incubation with GFP-expressing mCMV (either RM4503 or 
MC.55) or /3-gal expressing mCM V. Within 6 hr of viral addition 
to the culture (MOI = 15), some neurons and glia began to show 
GFP-mediated green fluorescence or 0-gal staining with each of 
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Figure 3. Mouse neurons in vitro: brief infection. A, A montage of a neuron in vitro for 5 d, 2 d after mCM V infection. Processes are labeled. Scale bar, 
22 jim. 5, A neuron with wide dendrites and a thin axon-like process in box. C, Higher magnification of box in B, showing a bright GFP-expressing axonal 
process growing on top of the dendrite. D, Axonal arbor 2 d after mCM V introduction. 



the three viruses used In cultures containing both glial cells and 
neurons, a greater number of astrocytes showed GFP labeling 
than did neurons at earlier time points. Within the next 24 hr, a 
greater number of cells showed expression of the reporter genes. 
In parallel, the expression of both GFP and 0-gal became much 
stronger. In cultures containing roughly equal numbers of astro- 
cytes and neurons, 96 astrocytes were labeled and only four 
neurons were found after 18 hr. The ratio changed over time, with 
greater numbers of neurons showing evidence of infection. 
Mouse brain cultures contained a mixture of neurons, astrocytes, 



and oligodendrocytes, as identified with immunolabeling for LI 
(neurons), glial fibrillary acidic protein (astrocytes), and myelin 
basic protein (oligodendrocytes) antisera, as described previously 
(van den Pol et al., 1992; van den Pol and Kim, 1993; Liljelund et 
al., 1994). 

Some cultures of hypothalamus, cortex, and hippocampus (n - 
6 each) were infected with mCM V at an MOI of 15. Interestingly, 
all cells, including neurons, astrocytes, and oligodendrocytes, 
died within 48 hr. This finding was based both on fluorescent 
microscopy and the use of video-enhanced differential interfer- 
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Figure 4. Mouse neurons in vitro: extended infection. A, Four days after 
infection axons show a greater degree of beading than seen in control 
cultures. B, C, Some axons in late stages of degeneration show only a few 
poorly connected swellings that are fluorescent. Scale bar, 10 /u.m. D, Two 
somata are seen with unusual round shapes and are devoid of processes 
5 d after infection. Scale bar, 12 /u,m. 

ence contrast (DIC) microscopy used to detect nonfluorescent 
cells. Ethidium homodimer (Molecular Probes, Eugene, OR), 
which labels the nuclei of dead cells, was also used to assess cell 
viability. In cultures made from postnatal day 2 glial cells from 
hippocampus, by 30 hr after infection, most astrocytes that had a 
typical sheet-like flat appearance were green, indicating infection 
with mCMV (Fig. 2A). After 48 hr, cells either were detached 
and floating or broken down with no cellular continuity; the cells 
still attached to the substrate were bright green and round, 
exhibiting viral cytopathic effects (Fig. 2B). In contrast, nonin- 
fected control cultures generated at the same time showed little 
evidence of cell death, suggesting that all types of brain cells can 
be killed by a high concentration of virus. When a lower dose of 
virus (MOI = 1) was used, cells survived much longer, with some 
neurons and glia still viable after 7 d. With time, however, all 
mouse brain cells died, whereas same-age noninfected controls 



remained viable. Although all infected cells expressed the re- 
porter genes, consistent with infection, the level of expression 
varied with time and dose of virus. The highest level of reporter 
gene expression came with longer survival times achieved with 
lower MOIs. This may have been because cells infected with high 
MOIs (>14) died before the higher levels of gene expression 
could be achieved. 

Earlier work on virus penetration into the brain suggested that 
the olfactory mucosa might be a potential portal of entry for virus 
into the brain by spread along the olfactory nerve as it entered the 
olfactory bulb. A primary cell type that encircles the olfactory 
axons is the olfactory ensheathing cell, a type of glial cell. 
Cultures of mouse olfactory ensheathing cells showed strong GFP 
expression after infection with mCMV (Fig. 2C-E). That the 
cultured cells were primarily ensheathing cells was confirmed 
with immunostaining for P75, the low-affinity NGF receptor that 
these cells selectively express (Vickland et al., 1991; Ramon- 
Cueto and Valverde, 1995). 

In control cultures, and in early stages of viral infection, neu- 
rons showed long processes that included both dendrites several 
hundred micrometers in length and axons exceeding 1000 jam in 
length. After mCMV infection, neurons initially began to turn 
green with no obvious sign of pathology (Fig. 3). However, soon 
after the neurites showed a green coloration, they began showing 
unusual dilations and constrictions in processes, some reaching 
several micrometers in diameter. Green processes were only 
found after the parent cell body showed reporter gene expression 
indicative of viral infection and gene expression. At a later stage 
of infection, processes began to break down and become strongly 
segmented (Fig. 4). At late stages of infection, parts of green 
processes could be found without connection to the cell body of 
origin (Fig. 4), and round cells devoid of processes were common 
(Fig. 4). These process dilations were not found in noninfected 
control cultures. With mCMV MOI of 15, the time course for 
neuronal degeneration was complete within 2 d, as determined in 
>20 cultures; with lower MOI (= 1), the time course could be 
extended. 

During later stages of infection, some groups of cells appeared 
to fuse together and had the appearance of giant multinucleated 
cells. These giant "cells" had diameters from 40 to 60 fxm, with 
large ones approaching 200 ftm. The largest of these appeared to 
be composed of >30 cells or nuclei (Fig. 5C, D). These cells 
showed continuous strong green fluorescence, suggesting that the 
outer membrane was intact. No giant cells were found that were 
not green, underlining the importance of viral infection and 
subsequent GFP expression. At later stages of infection, and after 
cell lysis, GFP was liberated into the culture medium, and the 
remaining cellular debris showed only faint green fluorescence. 
Giant multinucleated cells were not found in any control cultures. 
In culture dishes that contained giant cells, other groups of cells 
were found where the individual cells were not contained with a 
single membrane and therefore were not considered giant cells 
(Fig. 5A y B). Parallel detection of giant cells after CMV infection 
in humans has been reported (Belec et al., 1990; Horn et al., 
1992). The presence of giant cells suggests that CMV mediated 
fusion of the outer plasma membranes of different cells. 

Viral replication 

To assess viral replication in neurons and glia, tissue culture 
supernatant was taken at intervals after initial infection of brain 
cell cultures and complete washout of mCMV. The relative 
density of viral production was assessed by incubation of the 
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Figure 5. Giant cells after mCM V infection. DIC images are shown on the left, and on the right are shown the same field with fluorescent microscopy 
for detecting GFP.v4, B, Three days after infection of a mixed neuronal -glial culture, a small group of cells are tightly packed together but do not show 
characteristics of a giant cell. C, D, A giant cell appears to be composed of many other cells fused together and is seen with a few cells attached on the 
outside to the left. In C, the DIC image shows a relatively smooth outer membrane (arrows). Within the giant cell, a heterogeneous GFP-mediated 
fluorescence is seen, suggesting that there is not total cytoplasmic continuity at this point. Scale bar, 16 /xm. 



extracted media with a second set of cultures containing NIH 
3T3 fibroblasts, which are a fully permissive cell line for plaque 
assay and virus propagation. GFP-labeled 3T3 cells were then 
counted after 18 hr. 3T3 cells were used here because they are 
a common permissive host cell for raising CMV. The number 
of green 3T3 cells incubated with virus for a fixed period (18 
hr) increased with time (Fig. 6). In the initial test 18 hr after 
virus introduction into brain cell cultures, little evidence of 
virus was found in the brain culture supernatant. As the 
infection proceeded, the relative number of viral progeny that 
were produced increased dramatically, as deduced from the 
proportion of 3T3 cells that became green. Eight circular fields 
of 500 ftm diameter were counted per 3T3-containing cover- 
slips, which had a total area of 113 X 10 6 fim 2 . 

Viral replication of two GFP-expressing mCMVs (RM4503 
and MC.55) was compared in rat and mouse brain cultures (Fig. 
7). When a three-way ANOVA was used to test for statistical 
differences between conditions (virus X brain species X survival 
after virus infection), a significant difference was found (p < 
0.001). To make more discrete comparisons, a Tukey test was 



used. This revealed that each of the viruses used (RM4503 and 
MC.55) showed an increase in number over time (p < 0.05), with 
a slow initial rise time, followed by substantial increases in virus. 
Although the rate of replication was substantially lower in rat 
cells than in mouse cells (p < 0.05), viral replication was ob- 
served in rat brain cultures infected with the MC.55. However, 
little GFP was expressed in 3T3 cells incubated with tissue culture 
supernatant from rat cultures infected with a second virus, 
RM4503. This may be attributable to strain differences between 
RM4503 and MC.55. It could also be attributable in part to a 
lower general level of fluorescence found with RM4503, reducing 
detection in infected cells, or to a lower initial concentration of 
virus. 

The cultures described above contained a mixture of glia and 
neurons. To examine viral proliferation in more selective cul- 
tures, glial-enriched cultures were compared with cultures en- 
riched in cerebellar granule cell neurons (Fig. 8), each growing at 
roughly 75% confluency, and tested with neuron- and astrocyte- 
specific antisera, LI and GFAP, respectively. After inoculation 
with MC.55 (10 6 pfu per dish), the glia cultures showed a more 
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Figure 6. 3T3 test cultures used for viral replication test.^4, DIC image of N I H 3T3 culture incubated with tissue culture supernatant from brain cultures 
that had been infected for 18 hr. Arrow identifies a single cell. 5, Same field as in/4, showing a single fluorescent 3T3 cell (arrow). C, In parallel cultures, 
DIC imaging shows a group of 3T3 cells, one identified with an arrow to facilitate recognition in DIC and fluorescence. D, With fluorescent microscopy 
of the same field, most of the cells express GFP-mediated fluorescence. Tissue culture supernatant was taken from brain cultures 42 hr after mCM V 
infection. In both sets of experiments, 3T3 cells were examined 18 hr after their initial incubation with brain culture supernatant. Scale bar, 25 ^tm. 



rapid cell death than occurred in the neuronal cultures (Fig. 9). 
Analysis of virus replication showed that culture supernatant had 
low levels of virus during the first 18 hr, but then began to rapidly 
increase. By 42 hr, sufficient virus was produced to infect almost 
all of the 3T3 test cells. By 54 hr, approximately half of the glial 
cells showed signs of lethal infection, determined by cell swelling, 
detachment, and loss of cell number. At that time, most of the 
cells began to detach from the substrate, some even retaining high 
levels of GFP expression, suggesting that the plasma membrane 
was still intact. Because less than half of the astrocytes remained 
after 54 hr, samples were not tested further for release of virus 
progeny. 

In contrast, the neuronal-enriched cultures showed a substan- 
tial delay of reporter gene expression with —25% of the neurons 
showing detectable GFP after 54 hr, a time when all glial cells 
were infected and more than half the glia were dead. However, 
with time the number of neurons showing GFP expression in- 
creased. After 3 d of CMV infection, both infected and nonin- 
fected control cultures of granule cells showed strong neuritic 
extension. These were primarily composed of long bundles of 



granule cell axons and characteristically short granule cell den- 
drites. Interestingly, 5 d after mCMV infection, little evidence 
of neurites was found in the infected culture, whereas control 
cultures showed vigorous neuritic outgrowth, suggesting that 
mCM V caused a profound reduction in neurites (Fig. 8). Cells in 
the neuronal enriched cultures survived for a longer time than 
cells in the glial enriched cultures. Analysis of viral replication 
in 3T3 cells showed that 102 hr (4 d) after initial infection, 
sufficient virus was produced to turn all 3T3 test cells green. The 
rate of viral replication was significantly slower in neuronal cul- 
tures than found in glial cultures prepared at the same time 
(ANOVA, p < 0.01 ). 

Viral infection of mouse brain in vivo 

mCMV was microinjected at a single site in neonatal or adult 
mouse brains. After fixation and sectioning, the presence of 
mCM V was detected by the presence of either fluorescent GFP 
or x-gal staining in the presence of /3-galactosidase. Peroxidase 
immunostaining with antisera against GFP was also used as a 
sensitive means of detection. Of nine adult mice injected (5 x 10 5 
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MCMV Replication In Cultured Brain Cells 



A 




MCMV Collection (Hours post Infection) 

Figure 7. mCMV replication enhanced in mouse brain cultures. A, 
Using MC.55 that was engineered to express GFP under control of 
elongation factor 1 promoter, viral replication in brain cell cultures was 
studied by harvesting culture supernatant and determining the number of 
3T3 cells that showed GFP expression. Mouse brain culture supernatant 
generated strong GFP label in an increasing number of 3T3 cells, to the 
point that all 3T3 cells in culture were labeled with medium from cultures 
3 d after mCMV infection. Rat cultures showed a low level of viral 
replication. Forty-eight thousand 3T3 cells were plated per coverslip; the 
plateau between the last two mouse points was not attributable to a 
decrease in viral replication but rather to the fact that the maximum 
number of test 3T3 cells (48,000) had turned green. 5, Parallel experi- 
ments were performed with the mCMV under control of the human 
CMV promoter (RM4503). Replication was found in mouse brain cul- 
tures but at a lower rate than found with MC.55. With this viral strain, 
replication in rat brain cultures as deduced by GFP expression in 3T3 cells 
was minimal. 

pfu, 500 nl), none died in the 1-6 d after the injection. In 
contrast, of 17 mice injected at PI or P2, nine (53%) died in the 
following 4 d. mCMV injections into P3 mice were lethal to 4 
(40%) of 10 mice by 6 d after infection. 

Many different cell types in the brain showed evidence of viral 
infection. Robust expression of j3-gal and GFP was seen in the 
meninges (Fig. l()A,B) and choroid plexus (Fig. 10F). Some of 
the ependymal cells lining the third and lateral ventricles also 
were j3-gal or GFP positive (Fig. 10D,£). A recent report has 
suggested that ventricular ependymal cells may serve a role as 
neuronal stem cells (Johansson et al., 1999). If CM V causes loss 
of these cells, this may reduce potential neurogenesis, exacerbat- 
ing problems in the developing brain. Endothelial cells lining the 
blood vessels showed indication of viral presence, particularly in 
the region of the injection (Fig. 10C). In addition, virus-infected 



endothelial cells were found at a considerable distance from the 
injection site, suggesting spread of the virus through the 
vasculature. 

In older mice, in the third ventricle, infected cells included 
tanycytes, long radial glia that are present in both developing and 
mature brain with the cell body in the wall of the ventricle. In 
developing mice, dramatic expression of GFP was found in radial 
glia in many brain areas, particularly near a site of injection. 
Radial glia serve as important cellular guides for neuronal migra- 
tion during early development (Rakic, 1978). In the cerebral 
cortex, label was found in cell bodies of the radial glia in the 
deeper regions of the cortex and in the long processes that 
stretched outward, dividing into small endfeet at the pial surface 
of the cortex surface (Fig. \\A,B). Some infected radial glia 
showed signs of degeneration, characterized by abnormal swelling 
along the long glial process in both the inner (Fig.llC) and outer 
(Fig. 11D) cortex. These process dilations appear similar to 
process degeneration found in infected cultures. Although neu- 
rons and astrocytes greatly outnumber radial glial in general 
numbers in normal mice, in some developing mice (<5 d old) 
these radial glia accounted for most of the cells infected by 
mCMV in affected areas of the cortex. 

Some intracerebral injections of virus were made into different 
brain regions, including cortex, hippocampus, striatum, and hy- 
pothalamus. Particularly striking was the strong presence of virus 
in glial cells with the appearance of astroglia around the injection 
site. In addition, neurons were infected with virus; these cells had 
dendritic and axonal processes typical of neurons (Fig. 12), A 
large number of cases were found in many parts of the brain 
where neurons appeared to be the primary cell infected. Isolated 
neurons were found in the cortex (Fig. VIA), hypothalamus (Fig. 
125), and hippocampus (Fig. 12C). Of potential importance, 
some GFP-expressing neurons were found at considerable dis- 
tances from the injection site. This could have been caused in part 
by diffusion of the virus at the time of injection. However, in a 
number of cases, a random distribution pattern of the virus in 
distal regions of the brain was not found. Instead, a GFP-filied 
process could be followed from the labeled cell into the zone of 
brain damage elicited by penetration of the injecting microsyringe 
needle. This suggests that the virus may be transported within the 
neuron process back to the cell body where the virus entered the 
nucleus to express GFP. Many labeled axons were found far from 
the injection site (Fig. 12D-F), including in the white matter of 
the corpus callosum (Fig. 12£). 

Similar to neurons in culture, neurons in vivo that showed GFP 
expression several days after viral infection showed severe sym- 
metrical dilations and constrictions along long axons. Some of the 
dilations reached 5 or 6 fim in diameter; such large dilations were 
not seen in noninfected control axons. This type of dilation was 
also not found in control brain tissue labeled with Dil or horse- 
radish peroxidase injections, suggesting that it was not an artifact 
of fixation or histological processing. 

In developing mouse brains, intracerebral injections of mCM V 
can cause strong labeling of hippocampus. As shown in Figure 
13/1, cells surrounding the neonatal hippocampus show strong 
viral infection. Within the hippocampus, neurons and their pro- 
cesses are infected with the virus (Fig. 135). 

We found a number of cells that were clearly infected with 
mCMV, detected with expression of the viral GFP, but the 
infected cell showed little evidence of cytomegaly, the character- 
istic pathological determinant of CMV infection. This was true 
both in cultured cells and in histological sections from the brain. 
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Figure 8. Cerebellar granule neurons. A, Four days after mCM V (MC55) infection of a culture enriched with granule cells, many of the cells show 
bright GFP fluorescence. The same cell is indicated by an arrow in^-c' to facilitate recognition. B, Same field as in A but with partial fluorescence, partial 
phase contrast. Some cells seen in phase contrast are not fluorescent. C, Same field as in A but only with phase contrast. Scale bar, 100 /im. A higher 
magnification of C is shown in c* (arrow). Neurites are not found in these infected neurons. D, Control granule cell culture not infected with mCM V 
shows no fluorescence. E, Phase-contrast photomicrograph of culture 2 d after infection. Neurites are commonly found spreading out from groups of 
neurons, shown in higher magnification in e\ 
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MCMV Replication: Astrocyte vs Neuron 




MCMV Collection (Hours post Infection) 

Figure 9. mCMV replication in cultures enriched in astrocytes or neu- 
rons. Cultures enriched in cerebellar granule cells (neurons, •) or astro- 
cytes (O) were infected with MC.55. Viral replication was more rapid in 
the cultures enriched in astrocytes. Neuronal cultures also showed viral 
replication, but at a slower rate. Forty-eight thousand 3T3 cells were 
plated per coverslip. 



Thus some of the earlier reports suggesting a selective lack of 
infection of one type or another in the brain may simply be due 
to the absence of cytomegaly and the subsequent inability to 
detect infected cells. Histopathological detection of cytomegaly 
in tissues is insensitive but in the past has been used widely to 
search for evidence of CM V infection. The use of cytopathology 
as a marker for CMV infection at the cellular level is further 
complicated by the finding that induction of apoptosis in nonin- 
fected neurons can be generated by a secondary signal from 
CM V-infected cells in mice (Kosugi et al., 1998). The use of 
reporter genes as described here appears to be a more sensitive 
mode of assessment of mCMV presence than the cytomegaly 
commonly associated with CMV. Unlike approaches based on 
histological processing that may also be sensitive, for instance in 
situ hybridization (Myerson et al., 1984; Slobedman and Mocar- 
ski, 1999), our approach, particularly with GFP, does not require 
fixation or histochemical treatment. 

Gene transfer to rat and human 

The experiments with mouse brain cells suggested that mCMV 
might be a potentially useful vector for gene transfer into mouse 
brain cells. Given the potential toxicity of the virus to mouse cells, 
the virus would probably not be ideal for any beneficial action. 
However, because replication of CMVs is to a large degree 
species specific, we sought to determine whether CMV could be 
used for gene transfer to other species. To this end, we used rat 
and human cells in vitro and rat brain in vivo. 

Rat brain 

After mCM V infection, cultures of rat brain showed GFP expres- 
sion in both neurons and astrocytes. Microglia, identified by 
immunolabeling with an antibody against rat CDllb/c (Cedar- 
lane Labs, Hornby, Ontario, Canada) that binds to brain micro- 
glia and macrophages (Robinson et al., 1986), also showed GFP 
expression. There was also a low level of viral replication in rat 
brain cultures, as described earlier. 

In an in vivo set of experiments, mCMV was directly injected 
into rat brains (n = 12). After periods ranging from 1 to 7 d, 
animals (eight adult rats, four P5 rats) appeared healthy; no rat 



death was attributed to the mouse CMV. At or near the site of 
mCMV injection, strongly labeled neurons were found in hypo- 
thalamus, hippocampus, striatum, and cerebral cortex (Fig. 
14A,B,D,E). In the example in Figure 14A y BJD, adult striatal 
neurons showed very strong GFP labeling with little detectable 
GFP in the surrounding glial cells. Single axons and bundles of 
long axons could be followed from the striatal area several thou- 
sand micrometers toward the midbrain (Fig. 14C), and local axons 
were abundant around labeled cells. Spiny neurons in the striatum 
could be identified by the strong labeling in the dendritic spines 
(Fig. 14B). Despite the strong neuronal labeling in the striatum, 
relatively few cells with the morphology of glial cells could be 
found. Although the injection site included both cortex and stri- 
atum, the level of neuronal labeling in the striatum was dramat- 
ically stronger than in the cortex, suggesting some preferential 
infection of the virus. 

In its role as a vehicle for gene transfer, recombinant mCMV 
that expresses GFP may prove useful for some aspects of explor- 
ing neuronal circuitry and cell structure. This would not be 
limited to mouse, because we found strong labeling in rat brain 
and cultured human brain cells, suggesting that mCMV might 
also work in other species. Previous work with pseudorabies virus 
(Card et al., 1990, 1991, 1999), herpes simplex virus (Ho et al., 
1995; Meier et al., 1998), and Antiviruses based on HIV vectors 
with CMV promoters (Blomer et al., 1997; Miyoshi et al., 1997) 
has demonstrated the effectiveness of viral vectors in the brain. 

Human brain 

Although earlier studies suggested that mCMV did not infect 
human cells (Kim and Carp, 1971), in our hands human cells 
cultured from a brain astrocytoma showed intense green labeling 
after introduction of MC.55 (MOI = 2), indicating robust expres- 
sion of the transgene under control of the human elongation 
factor la promoter (Fig. 15A-C). Most of the human cells that 
showed infection on the basis of GFP expression showed little 
evidence of cytomegaly. The longest experiments in this series 
lasted 1 week, but because human cells continued to show strong 
fluorescence at this time, it is likely that expression would have 
continued for a longer period. In contrast, the number of mouse 
brain cells expressing GFP increased over time, so that after 1 
week all mouse cells were dead or green, suggesting viral repli- 
cation and de novo infection of additional generations of brain 
cells. In contrast to mouse brain cell cultures, there was no 
striking increase over time in the number of human brain cells 
that showed GFP expression, consistent with a lack of replication 
of the mCMV. Previous work with human CMV has demon- 
strated its presence in human astrocytes and astrocyte-related 
tumors (Duclos et al., 1989; Ho et al., 1991; McCarthy et al., 
1995). MC.55 (MOI = 2) also infected cultures of a cell line of 
human neuroblastoma cells (gift of Dr. K. Vives, Yale Universi- 
ty). These cells also showed strong levels of GFP expression (Fig. 
15D,£). 

Both of the experiments above were performed with human 
cells derived from a tumor and show abnormally high rates of cell 
division in vitro. Cells were also cultured from "normal" human 
cortex, removed surgically to provide access to a more ventral 
lesion. These cells replicated very slowly and appeared normal in 
culture. When these cultures were infected with mCM V (MOI = 
2), cells with a flat sheet-like morphology typical of astrocytes 
showed GFP expression indicating viral infection and gene ex- 
pression. In a typical culture well 2 weeks after low-density 
culturing of the human cells, and within 30 hr of CM V addition, 
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Figure 10. Intracerebral injections into mouse brain. mCMV injections were made into the lateral hypothalamus, with the syringe needle passing 
through and releasing virus into the ventricular system. A, The ventral region of the brain that includes the lateral hypothalamus (right of arrow). Scale 
bar, 65 jum. B t Same field as in A; shows GFP labeling of the meninges (arrow). C, Cells surrounding the blood vessels (arrows, bv) of the mammillary 
bodies after nearby injection of mCM V; the GFP label was stained with immunoperoxidase. /), Ependymal cells (arrows) of the lateral ventricle (lai V) 
are labeled (arrow) after immunoperoxidase staining of GFP. E, Some of the ependymal cells of the third ventricle (3rd V) show viral infection after 
x-gal staining for j8-gal. F, Many cells of the choroid plexus are labeled after injection of the 0-gal expressing mCM V. 



all 47 cells with the flat sheet-like morphology typical of astro- 
cytes showed GFP expression, indicating viral gene expression 
(Fig. 15F). At 2 weeks after infection, the longest interval studied, 
green cells were still found in these cultures. 

DISCUSSION 

CMV cell preference is relative, not absolute 

In the present study we found that all different types of cells in the 
brain can be infected with mCM V. This includes neurons, astro- 



cytes, ependymal cells lining the ventricles, ventricular tanycytes, 
radial glia, endothelial cells of the capillaries, ensheathing cells of 
the olfactory bulb, meninges, microglia, and cells of the choroid 
plexus. These data support the view that mCM V in the brain is 
an opportunistic virus, with no single cellular target. However, 
there was strong evidence that related subsets of cells showed a 
higher incidence of GFP expression, suggesting some viral pref- 
erence. An example of this was the high probability of striatal 
neuron infection coupled with the low incidence of infection in 
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Figure 11. Preferential infection of radial glia in developing mouse cortex. A, In mouse cortex, radial glia appear to be the primary cells infected 
after mCMV injections into the P2 mouse. Long processes travel from the cell body layer (arrow) and ascend to the outer cortical surface. The 
syringe needle passed through the cortex and into the hippocampus, and cells were labeled in both places. Scale bar, 50 pirn. B, In the same brain, 
near the midline, a number of processes cross the corpus callosum and continue through the developing gray matter. Scale bar, 15 /im. C, D, 
Processes of mCM V-infected radial glia in the deep cortex (C) and outer cortex (D) show dilations (small arrows) suggestive of pathology. Some 
dilations reach almost 10 jum in diameter. Other processes (long arrow) show no dilations. Scale bars: 20 /im. All micrographs here are after 
peroxidase immunostaining of GFP. 



glial cells in the same brain region, and the strong infection of 
cortical radial glia, discussed below. This could be attributable to 
differential cell-virus adhesion and cellular uptake, viral replica- 
tion, or differences in cellular activators of the promoters for the 
gene reporters used. Some neurotrophic viruses, for instance 
pseudorabies virus (Card et al., 1990, 1991, 1999) or herpes 
simplex-1 (Dash et al., 1996), may be selective for subsets of 
neurons. In the present study we found evidence for infection of 
various neuronal types from cortex, hippocampus, striatum, and 
hypothalamus. 



In our studies examining viral replication, mouse cultures en- 
riched in either astrocytes or neurons showed evidence of strong 
mCM V replication. Virus number peaked more rapidly in astro- 
cyte cultures than in neuronal cultures. This could be attributable 
in part to a greater level of replication in glial cells, although this 
was minimized by the use of defined medium not including serum 
and not favorable for astrocyte mitosis. Alternatively, although 
the cultures showed similar levels of confluency at the time of 
infection and were infected with equal amounts of virus, the glial 
cultures received a higher MOI, given the larger size of astrocytes 
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Figure 12. Mouse neuronal infection by mCMV. Neurons in the mouse brain were infected with mCMV as shown after irnmunoperoxidase staining 
of GFP in the cerebral cortex (A, scale bar, 12 /xm), hypothalamus (B, scale bar, 20 fxm), and hippocampus (C, scale bar, 25 /im). Axons show GFP label 
with no nearby neuronal cell body labeling, as shown in D (scale bar, 8 jum), E (scale bar, 5 /Am), and F (scale bar, 8 /xm). In E the GFP-ex pressing axons 
run parallel to the axis of many other axons running in the corpus callosum. 



compared with the small granule cell neurons. The larger mem- 
brane area of the cell body of astrocytes compared with neurons 
would also contribute to the increased probability of viral infec- 
tion. The more rapid expression of GFP in astrocytes than in 
neurons, coupled with the higher rate of replication in glia, is 
consistent with the concept that relative differences in viral in- 
fection exist. 



Strong viral expression was found in radial glia in the develop- 
ing mouse brain. One result of early CMV infection in human 
brains is the occurrence of microgyri (Diezel, 1954; Wolf and 
Cowen, 1972; Ho, 1991), a reduction in the size of cortical gyri. 
This has been previously attributed to vascular problems arising 
from CMV infection (Marques-Dias et al., 1984). However, we 
suggest another mechanism. Radial glia play an important role in 
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Figure 13. mCM V in neonatal mouse hippocampus. A, After intracere- 
bral injection into the P2 mouse, cells surrounding the hippocampus show 
strong evidence of infection 3 d later. Scale bar, 45 /mm. B, Higher 
magnification of the same brain shows labeling of some dendritic arbors 
(arrow) ventral to the pyramidal cell layer. Scale bar, 20 pun. 

serving as a pathway for the migration of neurons that are born in 
the ventricular zone and migrate outward along the radial glia 
in the developing rodent and primate brain (Rakic, 1978). Thus 
our data demonstrating CMV infection in radial glia in the 
developing brain suggest that after CMV-mediated radial glial 
degeneration, later outward neuronal migration may be blocked 
or misdirected. Thus microgyri or other anomalies of organiza- 
tion in CMV-infected human brains may hypothetically result 
from a selective loss of radial glia and subsequent abnormal 
neuronal migration. Further corroborating evidence is provided 
by studies in which mCMV was found to alter cortical neuronal 
migration in developing mice (Shinmura et al., 1997). Our finding 
of a high level of infection by mCMV in cells surrounding the 
hippocampus, with involvement of neurons within the hippocam- 
pus, may in some cases underlie hippocampal deficits, including 
epilepsy (Perez-Jiminez et al., 1998). 

CMV may enter the brain in several ways. Our data showing 
CMV infection of the endothelial cells of the vascular system is 
consistent with a previous report suggesting that blockade of the 
vascular system may cause brain deficits (Marques-Dias et al., 
1984; Wiley and Nelson, 1988). Another potential site of entry is 



the olfactory mucosa. Olfactory ensheathing cells that surround 
the olfactory nerve as it penetrates the brain can be infected with 
CMV. After intraperitoneal infection with CMV, the virus was 
detected in cells of the olfactory mucosa (Trgovcich et al., 1998), 
and intranasal administration leads to viral infection (Mannini 
and Medearis, 1961). The olfactory mucosa to the olfactory bulb 
has been suggested as one route of entry into the CNS for a 
number of viruses, including polio virus, semliki forest virus, and 
vesicular stomatitis virus (Johnson, 1998). Experimental injection 
of CMV in the guinea pig (Booss et al, 1989) or mouse (Tsutsui 
et al., 1995) peritoneal cavity results in brain infection a few days 
later. The finding of infection of vascular endothelial cells sug- 
gests that mCMV could use these cells as a potential means of 
entering or exiting the neuropil of the brain. CMV-infected 
lymphocytes may provide a means of facilitating entry into the 
brain. Another possibility for viral penetration into the brain may 
be through neuronal axons. Axonal transport is a critical mode of 
viral translocation for some other virus such as polio, rabies, and 
herpes simplex-1 (Johnson, 1998). Transport of the virus through 
processes would serve as a partial explanation for the selective 
GFP expression in striatal neurons. It is unlikely that the promot- 
ers used expressed reporter genes in only one cell type, because 
in culture all brain cells showed GFP expression after virus 
addition to the culture medium. 

Both in vivo and in vitro GFP-labeled axons showed severe 
dilations, and in culture, segments of labeled axons that had 
separated from the parent cell body. The lack of processes in later 
stages of infection of mouse neurons probably results from a 
combination of retraction of short processes and virally induced 
deterioration of the longer processes. Whether the virus may 
exert a direct and local effect on process deterioration or whether 
this process degeneration is secondary to infection in the cell 
body remains to be determined. 

mCMV-mediated gene transfer 

Although previous reports have suggested that mCMV does 
not infect human cells (Kim and Carp, 1971), by using the 
more sensitive recombinant mCMV described here we found 
robust GFP expression in human glial cells after infection. No 
obvious signs of cell death were found in mCM V-infected 
human cultures, suggesting that the mouse CMV may be a 
good candidate for further studies relating to use as a vector 
for gene introduction in human brain cells. A number of 
viruses, including herpes simplex-1, adenovirus, and adeno- 
associated virus, have been used for gene transfer in the CNS 
(Ho et al., 1995; Bartlett et al., 1998; Meier et al., 1998). Some 
viruses, for instance the Moloney murine leukemia retrovirus, 
may work well for gene transfer to dividing cells but work 
poorly in postmitotic neurons (Sena-Esteves et al., 1996). Al- 
though CMV has often provided its major immediate early 
gene promoter as a strong promoter of choice for expression of 
foreign genes in the brain, it has generally been adapted to 
other viruses such as adeno-associated virus (Lowenstein et 
al., 1996; Bartlett et al., 1998), and its expression from recom- 
binant CMV has not been previously evaluated. The large size 
of the CMV genome can be an advantage or disadvantage. On 
the positive side, large inserts can be engineered into mCMV, 
unlike some other viruses such as adeno-associated virus, 
which has a size restriction of a few kilobases. On the other 
hand, the CMV genome codes for >200 proteins the expres- 
sion of which might have consequences for the infected cell. 
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Figure 14. mCM V infects neurons of the rat brain. A, A medium-size spiny cell of the rat striatum is labeled 3 d after nearby mCMV administration 
and immunoperoxidase staining for GFP. Scale bar, 25 pim. B, Higher magnification of A shows high density of dendritic spines (arrows) on labeled cell. 
C, Axon in the midbrain, from the same brain as A-D, shows labeling. Scale bar, 8 /im. D, On an adjacent section to A, two more striatal neurons with 
their dendrites and axons show evidence of mCM V infection. Scale bar, 30 ixm. E, Neuron from cerebral cortex was infected with mCM V. Scale bar, 

20 fJLlYl. 



Molecular deletions can remove viral genes that may prove 
deleterious to the use of these viruses as vectors (Saederup et 
al., 1999). Future experimentation using replication competent 
or defective mCMV or hCMV should examine these 
possibilities. 

The fact that mCM V generated expression of GFP in neuronal 
cell bodies, dendrites, and axons suggests that these genetically 
engineered viruses may be useful as a tool for labeling neurons 
and their pathways. Labeled axons were found that extended 
thousands of micrometers away from perikarya near a zone of 
mCMV administration, indicating that GFP diffusion or trans- 
port is considerable. This is consistent with our previous work 
with transfection with plasmids coding for GFP under the regu- 
lation of the human major immediate early CMV promoter (van 



den Pol et al., 1998) or in transgenic mice expressing GFP under 
the control of a human CMV promoter (van den Pol and Ghosh, 
1998). Different sequences of the CMV promoter may result in 
reporter gene expression in different neurons of the CNS (Koe- 
dood et al., 1995; Baskar et al., 1996; Fritschy et al., 1996; van den 
Pol and Ghosh, 1998); differential activation of CMV promoters 
may serve as a partial explanation for why the virus may infect 
different regions of the brain and cause various neurological 
problems. With transgenic mice expressing reporter genes con- 
trolled by different regions of the immediate early promoter, 
neurons were commonly labeled, and labeled glia were rare; in 
contrast, in the present experiments we found that the mCMV 
retained the ability to express foreign genes in mouse, rat, and 
human glia and neurons. 
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Figure 15. Human brain cells are infected with mouse CMV./l-C, DIC, DIC + GFP, and GFP images of the same microscope field containing glial cells 
from a human astrocytoma are shown. A subset of cells shows strong GFP-mediated fluorescence. Scale bar, 10 yun. D, E y The same field shows the DIC 
image (D) and the GFP image (£). A subset of human neuroblastoma cells, 2 d after plating with mCMV, is infected with virus. Scale bar, 10 juun. f, 
"Normal" glial cells with an astrocyte morphology were cultured from human brain and show GFP expression 30 hr after infection. Scale bar, 10 /-on. 
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An amino acid sequence encodes a message that deter- 
mines the shape and function of a protein. This message is 
highly degenerate in that many different sequences can 
code for proteins with essentially the same structure and 
activity. Comparison of different sequences with similar 
messages can reveal key features of the code and improve 
understanding of how a protein folds and how it per- 
forms its function. 



THE GENOME IS MANIFEST LARGELY IN THE SET OF PRO- 
teins that it encodes. It is the ability of these proteins to fold 
into unique three-dimensional structures that allows them to 
function and carry out the instructions of the genome. Thus, 
comprehending the rules that relate amino acid sequence to struc- 
ture is fundamental to an understanding of biological processes. 
Because an amino acid sequence contains all of the information 
necessary to determine the structure of a protein (l) y it should be 
possible to predict structure from sequence, and subsequently to 
infer detailed aspects of function from the structure. However, both 
problems are extremely complex, and it seems unlikely that either 
will be solved in an exact manner in the near future. It may be 
possible to obtain approximate solutions by using experimental data 
to simplify the problem. In this article, we describe how an analysis 
of allowed amino acid substitutions in proteins can be used to 
reduce the complexity of sequences and reveal important aspects of 
structure and function. 



Methods for Studying Tolerance to 
Sequence Variation 

There are two main approaches to studying the tolerance of an 
amino acid sequence to change. The first method relies on the 
process of evolution, in which mutations are either accepted or 
rejected by natural selection. This method has been extremely 
powerful for proteins such as the globins or cytochromes, for which 
sequences from many different species are known (2-7). The second 
approach uses genetic methods to introduce amino acid changes at 



The authors arc in the Department of Biology, Massachusetts Institute of Technology, 
Cambridge, MA 02139. 

* Present address: Department of Chemistry and Biochemistry and the Molecular 
Biology Institute, University of California, Los Angeles, Los Angeles, CA 90024. 



specific positions in a cloned gene and uses selections or screens to 
identify functional sequences. This approach has been used to great 
advantage for proteins that can be expressed in bacteria or yeast, 
where the appropriate genetic manipulations are possible (J, 8-11). 
The end results of both methods are lists of active sequences that can 
be compared and analyzed to identify sequence features that are 
essential for folding or function. If a particular property of a side 
chain, such as charge or size, is important at a given position, only 
side chains that have the required property will be allowed. Con- 
versely, if the chemical identity of the side chain is unimportant, 
then many different substitutions will be permitted. 

Studies in which these methods were used have revealed that 
proteins are surprisingly tolerant of amino acid substitutions (2-4, 
11). For example, in studying the effects of approximately 1500 
single amino acid substitutions at 142 positions in lac repressor, 
Miller and co-workers found that about one-half of all substitutions 
were phenotypicaily silent (11). At some positions, many different, 
nonconservative substitutions were allowed. Such residue positions 
play litde or no role in structure and function. At other positions, no 
substitutions or only conservative substitutions were allowed. These 
residues are the most important for lac repressor activity. 

What roles do invariant and conserved side chains play in 
proteins? Residues that are directly involved in protein functions 
such as binding or catalysis will certainly be among the most 
conserved. For example, replacing the Asp in the catalytic triad of 
trypsin with Asn results in a 10 4 -fold reduction in activity (12). A 
similar loss of activity occurs in X repressor when a DNA binding 
residue is changed from Asn to Asp (13). To carry out their 
function, however, these catalytic residues and binding residues 
must be precisely oriented in three dimensions. Consequently, 
mutations in residues that are required for structure formation or 
stability can also have dramatic effects on activity (10 ; 14-16). 
Hence, many of the residues that are conserved in sets of related 
sequences play structural roles. 



Substitutions at Surface and Buried Positions 

In their initial comparisons of the globin sequences, Perutz and 
co-workers found that most buried residues require nonpolar side 
chains, whereas few features of surface side chains are generally 
conserved (6*). Similar results have been seen for a number of protein 
families (2, 4, 5, 7, 17, 18). An example of the sequence tolerance at 
surface versus buried sites can be seen in Fig. 1, which shows the 
allowed substitutions in X repressor at residue positions that are near 
the dimer interface but distant from the DNA binding surface of the 
protein (9). These substitutions were identified by a functional 
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Fig. 1. (A) Amino acid substitutions allowed in a 
short region of X repressor. The wild-type se- 
quence is shown along the center line. The al- 
lowed substitutions shown above each position 
were identified by randomly mutating one to 
three codons at a time by using a cassette method 
and applying a functional selection (9). (B) The 
fractional solvent accessibility (42) of the wild- 
type side chain in the protein dimcr (43) relative 
to the same atoms in an Ala-X-Ala model tripep- 
cide. 
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selection after cassette mutagenesis. A histogram of side chain 
solvent accessibility in the crystal structure of the dimer is also 
shown in Fig. 1. At six positions, only the wild-type residue or 
relatively conservative substitutions arc allowed. Five of these 
positions are buried in the protein. In contrast, most of the highly 
exposed positions tolerate a wide range of chemically different side 
chains, including hydrophilic and hydrophobic residues. Hence, it 
seems that most of the structural information in this region of the 
protein is carried by the residues that are solvent inaccessible. 



Constraints on Core Sequences 

Because core residue positions appear to be extremely important 
for protein folding or stability, we must understand the factors that 
dictate whether a given core sequence will be acceptable. In general, 
only hydrophobic or neutral residues are tolerated at buried sites in 
proteins, undoubtedly because of the large favorable contribution of 
the hydrophobic effect to protein stability (19). For example, Fig. 2 
shows the results of genetic studies used to investigate the substitu- 
tions allowed at residue positions that form the hydrophobic core of 
the NH 2 -terminal domain of X repressor (20). The acceptable core 
sequences are composed almost exclusively of Ala, Cys, Thr, Val, He, 
Leu, Met, and Phe. The acceptability of many different residues at 
each core position presumably reflects the fact that the hydrophobic 
effect, unlike hydrogen bonding, does not depend on specific 
residue pairings. Although it is possible to imagine a hypothetical 
core structure that is stabilized exclusively by residues forming 
hydrogen bonds and salt bridges, such a core would probably be 
difficult to construct because hydrogen bonds require pairing of 
donors and acceptors in an exact geometry. Thus the repertoire of 
possible structures diat use a polar core would probably be extreme- 
ly limited (21). Polar and charged residues are occasionally found in 
the cores of proteins, but only at positions where their hydrogen 
bonding needs can be satisfied (22). 

The cores of most proteins are quite closely packed (23), but some 
volume changes arc acceptable. In X repressor, the overall core 
volume of acceptable sequences can vary by about 10%. Changes at 
individual sites, however, can be considerably larger. For example, 
as shown in Fig. 2, both Phe and Ala are allowed at the same core 
position in the appropriate sequence contexts. Large volume 
changes at individual buried sites have also been observed in 



phylogenetic studies, where it has been noted that the size decreases 
and increases at interacting residues are not necessarily related in a 
simple complementary fashion (5, 7, 17). Rather, local volume 
changes are accommodated by conformational changes in nearby 
side chains and by a variety of backbone movements. 



The Informational Importance of the Core 

With occasional exceptions, the core must remain hydrophobic 
and maintain a reasonable packing density. However, since the core 
is composed of side chains that can assume only a limited number of 
conformations (24), efficient packing must be maintained without 
steric clashes. How important are hydrophobicity, volume, and 
steric complementarity in determining whether a given sequence can 
form an acceptable core? Each factor is essential in a physical sense, 
as a stable core is probably unable to tolerate unsatisfied hydrogen 
bonding groups, large holes, or steric overlaps (25). However, in an 
informational sense, these factors are not equivalent. For example, in 
experiments in which three core residues of X repressor were 
mutated simultaneously, volume was a relatively unimportant infor- 
mational constraint because three-quarters of all possible combina- 
tions of the 20 naturally occurring amino acids had volumes within 
the range tolerated in the core, and yet most of these sequences were 
unacceptable (20) . In contrast, of the sequences that contained only 



Fig. 2. Amino acid substitu- 
tions allowed in the core of X 
repressor. The wild-type side 
chains are shown pictorially in 
the approximate orientation 
seen in the crystal structure 
(43). The lists of allowed sub- 
stitutions at each position are 
shown below the wild-type 
side chains. These substitu- 
tions were identified by ran- 
domly mutating one to four 
residues at a time by using a 
cassette method and applying 
a functional selection (20) . 
Not all substitutions are al- 
lowed in every sequence back- 
ground. 
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the appropriate hydrophobic residues, a significant fraction were 
acceptable. Hence, the hydrophobicity of a sequence contains 
more information about its potential acceptability in the core than 
docs the total side chain volume. Steric compatibility was intermedi- 
ate between volume and hydrophobicity in informational impor- 
tance. 



Hie Informational Importance of Surface Sites 

We have noted that many surface sites can tolerate a wide variety 
of side chains, including hydrophilic and hydrophobic residues. This 
result might be taken to indicate that surface positions contain little 
structural information. However, Bashfbrd et a!., in an extensive 
analysis of globin sequences (4), found a strong bias against large 
hydrophobic residues at many surface positions. At one level, this 
may reflect constraints imposed by protein solubility, because large 
patches of hydrophobic surface residues would presumably lead to 
aggregation. At a more fundamental level, protein folding requires a 
partitioning between surface and buried positions. Consequently, to 
achieve a unique native state without significant competition from 
other conformations, it may be important that some sites have a 
decided preference for exterior rather than interior positions. As a 
result, many surface sites can accept hydrophobic residues individ- 
ually, but the surface as a whole can probably tolerate only a 
moderate number of hydrophobic side chains. 



Identification of Residue Roles from 
Sets of Sequences 

Often, a protein of interest is a member of a family of related 
sequences. What can we infer from the pattern of allowed substitu- 
tions at positions in sets of aligned sequences generated by genetic 
or phylogenetic methods? Residue positions that can accept a 
number of different side chains, including charged and highly polar 
residues, are almost certain to be on the protein surface. Residue 
positions that remain hydrophobic, whether variable or not, are 
likely to be buried within the structure. In Fig. 3, those residue 
positions in \ repressor that can accept hydrophilic side chains are 
shown in orange and those that cannot accept hydrophilic side 
chains are shown in green. The obligate hydrophobic positions 
define the core of the structure, whereas positions that can accept 
hydrophilic side chains define the surface. 

Functionally important residues should be conserved in sets of 
active sequences, but it is not possible to decide whether a side chain 
is functionally or structurally important just because it is invariant or 
conserved. To make this distinction requires an independent assay of 
protein folding. The ability of a mutant protein to maintain a stably 
folded structure can often be measured by biophysical techniques, 
by susceptibility to intracellular proteolysis (25), or by binding to 
antibodies specific for the native structure (27, 28). In the latter 
cases, it is possible to screen proteins in mutated clones for the 
ability to fold even if these proteins are inactive. Sets of sequences 
that allow formation of a stable structure can then be compared to 
the sets that allow both folding and function, with the active site or 
binding residues being those that are variable in the set of stable 
proteins but invariant in the set of functional proteins. The DNA- 
binding residues of Arc repressor were identified by this method (8). 
The receptor-binding residues of human growth hormone were also 
identified by comparing the stabilities and activities of a set of 
mutant sequences (28). However, in this case, the mutants were 
generated as hybrid sequences between growth hormone and related 
hormones with different binding specificities. 

U08 



Implications for Structure Prediction 

At present, the only reliable method for predicting a low- 
resolution tertiary structure of a new protein is by identifying 
sequence similarity to a protein whose structure is already known 
(29, 30). However, it is often difficult to align sequences as the level 
of sequence similarity decreases, and it is sometimes impossible to 
detect statistically significant sequence similarity between distantly 
related proteins. Because the number of known sequences is far 
greater than the number of known structures, it would be advanta- 
geous to increase the reach of the available structural information by 
improving methods for detecting distant sequence relations and for 
subsequently aligning these sequences based on structural principles. 
In a normal homology search, the sequence database is scanned with 
a single test sequence, and every residue must be weighted equally. 
However, some residues arc more important than others and should 
be weighted accordingly. Moreover, certain regions of the protein 
are more likely to contain gaps than others. Both kinds of informa- 
tion can be obtained from sequence sets, and several techniques have 




Fig. 3. Tolerance of positions in the NH r terminal domain of X repressor to 
hydrophilic side chains. The complex (43) of the repressor dimer (blue) and 
operator DNA (white) is shown. In (A), positions that can tolerate 
hydrophilic side chains arc shown in orange. The same side chains are shown 
in (B) without the remaining protein atoms. In (C), positions that require 
hydrophobic or neutral side chains are shown in green. These side chains are 
shown in (D) without the leniaining protein atoms. About three-fourths of 
the 92 side chains in the NH 2 -terminal domain are included in both (B) and 
(D). The rernaining positions have not been tested. Data are from (9, 14, 20, 
27, 44). 
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been used to combine such information into more appropriately 
weighted sequence searches and alignments (37). These methods 
were used to align the sequences of retroviral proteases with aspartic 
proteases, which in turn allowed construction of a three-dimension- 
al model for the protease of human immunodeficiency virus type 1 
(29), Comparison with the recently determined crystal structure of 
this protein revealed reasonable agreement in many areas of the 
predicted structure (32). 

The structural information at most surface sites is highly degener- 
ate. Except for functionally important residues, exterior positions 
seem to be important chiefly in maintaining a reasonably polar 
surface. The information contained in buried residues is also 
degenerate, the main requirement being that these residues remain 
hydrophobic. Thus, at its most basic level, the key structural 
message in an amino acid sequence may reside in its specific pattern 
of hydrophobic and hydrophilic residues. This is meant in an 
informational sense. Clearly, the precise structure and stability of a 
protein depends on a large number of detailed interactions. It is 
possible, however, that structural prediction at a more primitive 
level can be accomplished by concentrating on the most basic 
informational aspects of an amino acid sequence. For example, 
amphipathic patterns can be extracted from aligned sets of sequences 
and used, in some cases, to identify secondary structures. 

If a region of secondary structure is packed against the hydropho- 
bic core, a pattern of hydrophobic residues reflecting the periodicity 
of the secondary structure is expected (33, 34). These patterns can be 
obscured in individual sequences by hydrophobic residues on the 
protein surface. It is rare, however, for a surface position to remain 
hydrophobic over the course of evolution. Consequently, the am- 
phipathic patterns expected for simple secondary structures can be 
much clearer in a set of related sequences (6). This principle is 
illustrated in Fig. 4, which shows helical hydrophobic moment plots 
for the Antennapedia homeodomain sequence (Fig. 4A) and for a 
composite sequence derived from a set of homologous homeodo- 
main proteins (Fig. 4B) (35). The hydrophobic moment is a simple 
measure of the degree of amphipathic character of a sequence in a 
given secondary structure (34). The amphipathic character of the 
three a-helical regions in the Antennapedia protein (36) is clearly 
revealed only by the analysis of the combined set of homeodomain 
sequences. The secondary structure of Arc repressor, a small DNA- 
binding protein, was recently predicted by a similar method (8) and 
confirmed by nuclear magnetic resonance studies (37). 

The specific pattern of hydrophobic and hydrophilic residues in 
an amino acid sequence must limit the number of different structures 
a given sequence can adopt and may indeed define its overall fold. If 
this is true, then the arrangement of hydrophobic and hydrophilic 
residues should be a characteristic feature of a particular fold. Sweet 
and Eisenberg have shown that the correlation of the pattern of 
hydrophobicity between two protein sequences is a good criterion 
for their structural relatedness (38). In addition, several studies 
indicate that patterns of obligatory hydrophobic positions identified 
from aligned sequences are distinctive features of sequences that 
adopt the same structure (4, 29 f 38, 39). Thus, the order of 
hydrophobic and hydrophilic residues in a sequence may actually be 
sufficient information to determine the basic folding pattern of a 
protein sequence. 

Although the pattern of sequence hydrophobicity may be a 
characteristic feature of a particular fold, it is not yet clear how such 
patterns could be used for prediction of structure de novo. It is 
important to understand how patterns in sequence space can be 
related to structures in conformation space. Lau and Dill have 
approached this problem by studying the properties of simple 
sequences composed only of H (hydrophobic) and P (polar) groups 
on two-dimensional lattices (40). An example of such a representa- 



tion is shown in Fig. 5. Residues adjacent in the sequence must 
occupy adjacent squares on the lattice, and two residues cannot 
occupy the same space. Free energies of particular conformations arc 
evaluated with a single term, an attraction of H groups. By 
considering chains of ten residues, an exhaustive conformational 
search for all 1024 possible sequences of H and P residues was 
possible. For longer sequences only a representative fraction of the 
allowed sequence or conformation space could be explored. The 
significant results were as follows: (i) not all sequences can fold into 
a "native" structure and only a few sequences form a unique native 
structure; (ii) the probability that a sequence will adopt a unique 
native structure increases with chain length; and (iii) the native 
states are compact, contain a hydrophobic core surrounded by polar 
residues, and contain significant secondary structure. Although the 
gap between these two-dimensional simulations and three-dimen- 
sional structures is large, the use of simple rules and sequence 
representations yields results similar to those expected for real 
proteins. Three-dimensional lattice methods are also beginning to 
be developed and evaluated (41). 



Summary 

There is more information in a set of related sequences than in a 
single sequence. A number of practical applications Vise from an 
analysis of the tolerance of residue positions to change. First, such 
information permits the evaluation of a residue's importance to the 
function and stability of a protein. This ability to identify the 
essential elements of a protein sequence may improve our under- 
standing of the determinants of protein folding and stability as well 
as protein function. Second, patterns of tolerance to amino acid 
substitutions of varying hydrophilicity can help to identify residues 
likely to be buried in a protein structure and those likely to occupy 



Fig. 4. Helical hydro- 
phobic moments calcu- 
lated by using (A) the 
Antennapedia homeodo- 
main sequence or (B) a 
set of 39 aligned homeo- 
domain sequences (35). 
The bars indicate the ex- 
tent of the helical re- 
gions identified in nucle- 
ar magnetic resonance 
studies of the Antenna- 
pedia homeodomain 
(36). To determine hy- 
drophobic moments, 
residues were assigned 
to one of three groups: 
HI (high hydrophobici- 
ty = Trp, He, Phc, Leu, 
Met, Val, or Cys); H2 
(medium hydrophobic- 
ity = Tyr, Pro, Ala, Thr, 
His, Gly, or Scr); and H3 (low hydrophobicity = Gin, Asn, Glu, Asp, Lys, 
or Arg). For the aligned homeodomain sequences, the residues at each 
position were sorted by their hydrophobicity by using the scale of Fauchcrc 
and Pliska (45). Arg and Lys were not counted unless no other residue was 
found at the position, because they contain long aliphatic side chains and can 
thereby substitute for nonpolar residues at some buried sites. To account for 
possible sequence errors and rare exceptions, the most hydrophilic residue 
allowed at each position was discarded unless it was observed twice. The 
second most hydrophilic residue was then chosen to represent the hydropho- 
bicity of each position. An eight-residue window was used and the vectors 
projected radially every 100°. The vector magnitudes were assigned a value of 
1, 0, or - I for positions where the hydrophobicity group was HI, H2, or 
H3, respectively. 
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Fig. 5. A representation of one com- 
pact conformation for a particular 
sequence of H and P residues on a 
two-dimensional square lattice. 
[Adapted from (40), with permis- 
sion of the American Chemical Soci- 
ety] 



surface positions. The amphipathic patterns that emerge can be used 
to identify probable regions of secondary structure. Third, incorpo- 
rating a knowledge of allowed substitutions can improve the ability 
to detect and align distantly related proteins because the essential 
residues can be given prominence in the alignment scoring. 

As more sequences are determined, it becomes increasingly likely 
that a protein of interest is a member of a family of related 
sequences. If this is not the case, it is now possible to use generic 
methods to generate lists of allowed amino acid substitutions. 
Consequendy, at least in the short term, it may not be necessary to 
solve the folding problem for individual protein sequences. Instead, 
information from sequence sets could be used. Perhaps by simplify- 
ing sequence space through the identification of key residues, and by 
simplifying conformation space as in the lattice methods, it will be 
possible to develop algorithms to generate a limited number of trial 
structures. These trial structures could then, in turn, be evaluated by 
further experiments and more sophisticated energy calculations. 



REFERENCES AND NOTES 

1. CI. Epstein, R. F. Goldberger, C B. Anfinscn, Cold Spring Harbor Symp. Quant. 
Biol. 28, 439 (1963); C B. Anfinscn, Sdmce 181, 223 (1973). 

2. R. E. Dickerson, Sci. Am. 242, 136 (March 1980). 

3. M. D. Hampsey, G. Das, F. Sherman, FEBS Lett. 231, 275 (1988). 

4. D. Bashford, C Chothia, A. M. Lesk, J. Mol. Biol. 196, 199 (1987). 

5. A. M. Lcsk and C. Chothia, ibid. 136, 225 (1980). 

6. M. F. Perutz, J. C Kendrew, H. C Watson, ibid. 13, 669 (1965), 

7. C Chothia and A. M. Lesk, Cold Spring Harbor Symp. Quant. Biol. 52, 399 (1965). 

8. J. U. Bowie and R. T. Sauer, Proc. Natl. Acad. Sci. U.S.A. 86, 2152 (1989). 

9. J. F. Rcidhaar-Olson and R. T. Sauer, Science 241, 53 (1988); Proteins Struct. Ftmct. 
Genet. , in press. 

10. D. Shortle, J. Biol Chem. 264, 5315 (1989). 

11. J. H. Miller ex al, J. Mol. Biol. 131, 191 (1979). 



12. S. Sprang et at., Science 237, 905 (1987); C S- Craik, S. Roaniak, C Largman, W. 
J. Rutter, ibid., p. 909. 

13. H. C M. Nelson and R. T, Sauer, J. Mot. Biol. 192, 27 (1986). 

14. M. H. Hccht, J. M. Sturtevant, R. T. Sauer, Proc. Natl. Acad. Sci. U.S.A. 81, 5685 
(1984). 

15. T. Albcr, D. Sun, J, A. Nye, D. C Muchmore, B. W. Matthews, Biochemistry 26, 
3754(1987). 

16. D. Shortle and A. K. Meeker, Proteins Struct. Fund. Genet. 1, 81 (1986). 

17. A. M. Lesk and C. Chothia, J. Mol. Biol. 160, 325 (1982). 

18. W. R. Taylor, ibid. 188, 233 (1986). 

19. W. Kauzmann, Adv. Protein Chem. 14, 1 (1959); R. L. Baldwin, Proc. Natl. Acad. 
Sci. U.S.d.83, 8069(1986). 

20. W. A. Lim and R. T. Sauer, Naturt 339, 31 (1989); in preparation. 

21. Lesk and Chothia (5) have argued that a protein core composed solely of hydrogen- 
bonded residues would also be inviablc on evolutionary grounds, as a mutational 
change in one core residue would require compensating changes in any interacting 
residue or residues to maintain a stable structure. 

22. T. M. Gray and B. W. Matthews, J. Mol. Biol. 175, 75 (1984); E. N. Baker and R. 
E. Hubbard, Prog. Biophys. Mol. Biol. 44, 97 (1984). 

23. F. M. Richards, J. Mol. Biol. 82, 1 (1974). 

24. J. W. Ponder and F. M. Richards, ibid. 193, 775 (1987). 

25. J. T. Kellis, Jr., K. Nyberg, A. R. Fcrsht, Biochemistry 28, 4914 (1989); W. S. 
Sandberg and T. C Tcrwilliger, Science 245, 54 (1989). 

26. A. A. Pakula and R. T. Sauer, Proteins Struct. Funct. Genet. 5, 202 (1989). 

27. B. C Cunningham and J. A. Wells, Science 244, 1081 (1989); R. M. Brcycrand R. 
T. Sauer, J. Biol. Chem. 264, 13348 (1989). 

28. B. C Cunningham, P. Jhurani, P. Ng, J. A. Wdls, Science 243, 1330 (1989). 

29. L. H. Pearl and W. R. Taylor, Nature 329, 351 (1987). 

30. W.^Brownrt(I/.,/Mo^BI■o^42 > 65(1969);^Greer,lm 153, 1027 (1981); J. 
M. Berg, Proc. Natl. Acad. Sci. U.S.A. 85, 99 (1988). 

31. W. R. Taylor, Protein Eng. 2, 77 (1988). 

32. M. A. Navia et at., Nautre 337, 615 (1989). 

33. M. Schiffer and A. B. Edmundson, Biophys. J. 7, 121 (1967); V. I. Lim, J. Mol. 
Biol. 88, 857 (1974); ibid., p. 873. 

34. D. Eisenberg, R. M. Weiss, T. C TerwiUiger, Nature 299, 371 (1982); D. 
Eiscnbcrg, D. Schwarz, M. Komaromy, R. Wall, Mol. Biol. 179, 125 (1984); D. 
Eisenberg, R. M. Weiss, T. C. Tcrwilliger, Proc. Natl. Acad. Sci. U.S.A. 81, 140 
(1984). 

35. T. R. Burglin, Cell 53, 339 (1988). 

36. G. Otting et at., EMBO J. 7, 4305 (1988). 

37. J. N. Brcg, R. Boelcns, A. V. E. George, R. Kaptein, Biochemistry 28, 9826 (1989); 
M. G. Zagorski, J. U. Bowie, A. K. Vershon, R. T. Sauer, D. J. Patcl, ibid., p. 
9813. 

38. R. M. Sweet and D. Eisenberg, J. Mol. Biol. 171, 479 (1983). 

39. J, U. Bowie, N. D. Clarke, C O. Pabo, R. T. Sauer, Proteins Struct. Ftmct. Genet., in 
preparation. 

40. K. F. Lau and K. A. Dill, Macromolecutcs 22, 3986 (1989). 

41. A. Sikorski and J. Skolnick, Proc. Natl. Acad. Sci. U.S.A. 86, 2668 (1989); A. 
Kolinski, J. Skolnick, R. Yaris, Biopolymers 26, 937 (1987); D. G. CoveU and R. L. 
Jcmigan, Biochemistry, in press. 

42. B. Lee and F. M. Richards, J. Mol. Biol. 55, 379 (1971). 

43. S. R. Jordan and C O. Pabo, Science 242, 893 (1988). 

44. R. M. Breyer, thesis, Massachusetts Institute of Technology, Cambridge (1988). 

45. J.-L. Fauchere and V. Pliska, Eur. J. Med. Chem.-Chim. Ther. 18, 369 (1983). 

46. We thank C. O. Pabo and S. Jordan for coordinates of the NHj-terminal domain of 
A repressor and its operator complex. We also thank P. Schimmd for the use of his 
graphics system and J. Burnbaum and C. Francklyn for assistance. Supported in 
part by NIH grant AI- 15706 and predoctoral grants from NSF (J.R.-O.) and 
Howard Hughes Medical Institute (W.A.L.). 



HIO 



SCIENCE, VOL. 24-7 



American 

Association for the 
Advancement of 
Science 



SdENCE 



issn 0036-8075 
16 March 1990 
Volume 247 
Number 4948 



1271 This Week in Science 



Editorial 



Letters 



INiews? & Comment 



Research i\c\vs 



Article 



Research Article 



Report? 



1273 Support for Materials Science and Engineering 



1280 U.S. Oil and Gas Consumption: Is Another Crisis Ahead?: D. B. Hawkins; 
R. L. Hirsch ■ Punitive Damages and Innovation: D. H. Kaye; A. Brown; 
R. L. Randall; R. J. Mahoney and S. E. Little jo hn 

1283 SDI Heads for Fiscal Crash ■ With a Little Help ■ Army, Air Force Eye SDI 
Spinoffs ■ The Ups and Downs of Strategic Defense Weapons 

1286 In Peru, Even Potato Research Is High Risk 

1287 Gene Therapy Clears First Hurdle 

1288 Greenpeace and Science: Oil and Water? 

1290 Academy of Engineering Elects New Members 

1291 Particle Physicists Look to the Heavens 

1294 Heart Like a Wheel 

1295 Tweaking Molecules with Laser Light 

1296 CF Screening Delayed for a While, Possibly Forever 

1297 Ozone Destruction Closer to Home 

1298 Briefings: Prospectors Cash In on Geologist's Find ■ Transgenic Carp: Pond- 
Ready? ■ Technology Unlocks "Lost" Writings ■ Stanford Patents Pay 

1299 Drug Prevention in Junior High: A Multi-Site Longitudinal Test: 
P. L. Ellickson and R. M. Bell 

1306 Deciphering the Message in Protein Sequences: Tolerance to Amino Acid 

Substitutions: J. U. Bowie, J. F. Reidhaar-Olson, W. A. Lim, R. T. Sauer 

1311 Deep Magma Body Beneath the Summit and Rift Zones of Kilauea Volcano, 

Hawaii: P. T. Delaney, R. S. Fiske, A. Miklius, A. T. Okamura, M. K. Sako 

1317 Femtosecond Pulse Sequences Used for Optical Manipulation of Molecular 

Motion: A. M. Weiner, D. E. Leaird, G. P. Wiederrecht, K. A. Nelson 



■ SCIENCE is published weekly on Friday, except the last week In December, end with e supplement In March by 
ttw American Association for the Advancement of Science, 1333 H Street, NW, Washington, DC 20005. Second- 
class Non-profit postage (publication No. 484460) paid at Washington, DC, and at an additional entry. Copyright r> 1990 
by the American Association for the Advancement of Science. Trie tme SCIENCE is a registered trademark of the AAAS 
Domestic individual rrwmbership and subscription (51 issues): $75. Domestic Institutional subscription (51 issues): $120. 
Foreign postage extra: Canada $46, other (surface mall) $46. air mail via Amsterdam $85. Rrst class, airmail, school 
year, and student rates on request Single copy sales: Current Issue, $3.50; back Issues. $5.00; Biotechnology ismw. 
$6.00 (for postage and handling, add per copy $0.50 U.S., $1.00 ail foreign); Guide to Biotechnology Products nnd 
Instruments, $18 (tor postage and handling add per copy $1.00 U.S.. $1.50 Canada, $2.00 other foreign). Bulk mi*, m. 
request Authorization to photocopy materia] tor Internal or personal use under drcumstances not falling within it » u» 
use provisions of the Copyright Act is granted by AAAS to libraries and other users registered with the Cofiykj»t 
Clearance Center (CCC) Transactional Reporting Service, provided that the base tee of $1 per copy plus $0 in i** i«oo 
is paid directly to CCC. 27 Congress Street, Salem. Massachusetts 01970. The identification code for .<m:k>/**> i» now 
8075/83 $1 + .10. Change of address: allow 6 weeks, giving old and new addresses and 11-digit account numtMi. 
Postmaster: Send Form 3579 to Sctence, P.O. Box 1723, Rlverton, NJ 08077. Science is Indexed tn om tu*<u» * 
Guide to Periodica} Literature and fei several specialized indexes. 

■ The American Association for the Advancement of Science was founded in 1848 and incorporated in 1874. Ha objectives 
are to further the work of scientists, to facilitate cooperation among them, to foster scientific freedom and responsibility, 
to improve the efectiveness of science in the promotion of human welfare, to advance education in science, end to 
Increase public understanding and appreciation of the importance and promise of the methods of science in human 
progress. 



1268 



SCIENCE, VOL. 247 




cover Basalt lava erupted from Kilauea Volcano, Hawaii, pours 3 meters from 
a shallow tube into the Pacific Ocean. The current eruption began in January 1983- 
the lava produced now covers 70 square kilometers, including 150 acres of new land 
formed along the seacoast. Analysis of more than 20 years of ground deformation 
data suggests that Kilauea's magma system is deeper and more extensive than 
previously thought. See page 1311. [Photographed on 27 November 1989 by J. D 
Griggs, U.S. Geological Survey] 
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